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Abstract 

We consider broadcasting a block of packets to multiple wireless receivers under random packet erasures 
using instantly decodable network coding (IDNC). The sender first broadcasts each packet uncoded once, then 
generates coded packets according to receivers’ feedback about their missing packets. We focus on strict IDNC 
(S-IDNC), where each coded packet includes at most one missing packet of every receiver. But we will also 
compare it with general IDNC (G-IDNC), where this condition is relaxed. We characterize two fundamental 
performance limits of S-IDNC: 1) the number of transmissions to complete the broadcast, and 2) the average 
delay for a receiver to decode a packet. We derive a closed-form expression for the expected minimum number 
of transmissions in terms of the number of packets and receivers and the erasure probability. We prove that it is 
NP-hard to minimize the decoding delay of S-IDNC. We also derive achievable upper bounds on the above two 
performance limits. We show that G-IDNC can outperform S-IDNC without packet erasures, but not necessarily 
with packet erasures. Next, we design optimal and heuristic S-IDNC transmission schemes and coding algorithms 
with full/intermittent receiver feedback. We present simulation results to corroborate the developed theory and 
compare with existing schemes. 

Index terms- Wireless broadcast, network coding, throughput, decoding delay, instantly decode. 

I. INTRODUCTION 

The broadcast nature of wireless medium allows one sender to simultaneously serve multiple receivers 
who are interested in the same data. We consider a block-based wireless broadcast system where a sender 
wishes to deliver a block of data packets to a set of receivers, with the channels between the sender and 
the receivers are subject to independent random packet erasures. A traditional approach is to send the 
data packets unaltered under a receiver feedback mechanism, such as Automatic-Repeat-reQuest (ARQ) 
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[1]. This approach, though simple, is inefficient in terms of throughput, as the transmitted packets are 
non-innovative to the receivers who have already received them. 

The advent of network coding [2] starts a new era for high throughput network coded wireless 
communications [3]—[17]. By linearly adding all data packets together with randomly chosen coefficients 
from a sufficiently large finite field, random linear network coding (RLNC) can almost surely achieve 
the optimal block completion time in block-based wireless broadcast [9]—[11], [18], which is defined 
as the number of transmissions it takes to complete the broadcast, and is a fundamental measure of 
throughput due to their inverse relation under a fixed block size. Compared to other optimal codes such 
as Fountain codes [19], [20], RLNC is preferred due to its ease of implementation at the sender and 
extension to more complex networks and traffic settings. 

However, with RLNC, data packets are block-decoded by solving a set of linear equations, which only 
takes place after a sufficient number of coded packets have been received. RLNC thus may suffer from 
heavy computational load [11] and packet decoding delay [17], which is measured by the average time 
it takes for a receiver to decode a data packet. The first issue can, for example, hinder the application 
of RLNC for mobile receivers with limited computational capability [11]. Meanwhile, a large packet 
decoding delay can be unacceptable for delay-sensitive applications such as video streaming [21], [22]. 

To mitigate these issues, instantly decodable network coding (IDNC) techniques [3], [ 12]—[ 15] have 
been introduced. With IDNC, the sender first broadcasts the data packets uncoded once. It then makes 
online coding decisions based on receivers’ feedback about their packet reception state, under the 
restriction that coding/decoding is over binary field. A simple packet reception state is demonstrated in 
Table I. There are two data packets, pi and p 2 , and three receivers, R\ to /?.>, where each has a subset 
of {pi, P 2 } an d wants the rest. Consider a coded packet of X = p! © p 2 , where © denotes binary 
XOR. It has three different effects on different receivers: 1) it is instantly decodable to Ri, because R\ 
can decode p 2 by performing X © pi; 2) it is non-instantly decodable to f? 2 , because f? 2 has neither 
Pi nor p 2 ; and 3) it is non-innovative to R 3 , because R 3 already has both p! and p 2 . 

There are two main types of IDNC techniques. The first one, called strict IDNC (S-IDNC) [13]—[15], 
prohibits the transmissions of non-instantly decodable packets to any receiver. Effectively, each coded 
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TABLE I 

A SIMPLE PACKET RECEPTION STATE FOR IDNC CODING 



Pi 

P2 

Ri 

has 

wants 

R'2 

wants 

wants 

Rs 

has 

has 


packet can include at most one wanted data packet of every receiver. The second one, called general 
IDNC (G-IDNC), removes this restriction to generate more coding opportunities. 

There is a large body of research on G-IDNC, focusing on its throughput and decoding delay 
performance, coding algorithms, and transmission schemes. Early models and heuristics related to G- 
IDNC were proposed for index coding [23]. Then G-IDNC was introduced for wireless broadcast 
and was graphically modeled in [12]. Although its best performance remains unidentified, powerful 
heuristic algorithms have been developed to improve its throughput and/or decoding delay [12], [24]- 
[26], or to strike a balance between the two [27]. G-IDNC transmission schemes under full/intermittent 
receiver feedback have been developed [28], [29]. G-IDNC has also been adopted in wireless broadcast 
applications with hard decoding deadlines [21] or with receiver cooperation [30]-[32]. 

Studies on theoretical performance characterization and implementations of S-IDNC are more limited 
in both range and depth. S-IDNC was graphically modeled in [13], which then proved that the minimum 
clique partition solution [13] of the associated graph can be an S-IDNC solution that minimizes the block 
completion time. However, this solution does not take into account the issues of decoding delay and 
the robustness of coded transmissions to erasures. S-IDNC has shown to be asymptotically throughput 
optimal when there are up to three receivers or when the number of data packets approaches infinity 
[21], but the general relation between the throughput of S-IDNC and system parameters has not been 
characterized before. In addition and to the best of our knowledge, the minimum packet decoding delay 
of S-IDNC is still unknown. Moreover, there have not been S-IDNC transmission schemes that can 
work with intermittent receiver feedback. Another unaddressed problem is a systematic performance 
comparison between S-IDNC and G-IDNC. 
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In this paper, we study the above problems and provide the following contributions: 

1) We characterize the throughput performance limits of S-IDNC. Specifically, we first derive an 
achievable upper bound on the minimum block completion time for any given packet reception 
state. We then derive a closed-form expression for the expected minimum block completion time 
in terms of the number of packets and receivers and the erasure probability; 

2) We prove that it is NP-hard to minimize the packet decoding delay of S-IDNC. We derive an upper 
bound on the minimum packet decoding delay in terms of the minimum block completion time; 

3) We show that in the presence of erasures, the minimum clique partition solution of the S-IDNC 
graph as identified by [13] may not result in the minimum block completion time, because it 
does not allow the same data packet to be repeated in different coded packets, a desired property 
which we refer to as packet diversity. Motivated by this fact, we develop the optimal S-IDNC 
coding algorithm, as well as heuristics that aim to improve packet diversity. We design S-IDNC 
transmission schemes under full and intermittent receiver feedback; 

4) We also provide new results on how S-IDNC and G-IDNC compare. We study the relation be¬ 
tween S-IDNC and G-IDNC graphs, and then demonstrate some scenarios under which G-IDNC 
can/cannot outperform S-IDNC. 

II. System Model and Notations 

A. Transmission Setup 

We consider a block-based wireless broadcast scenario, in which the sender needs to deliver a block 
of K data packets, denoted by Vk = {P/.ltLi - to N receivers, denoted by 1Z N = {R n }n=i through 
wireless channels that are subject to independent random packet erasures. 

Initially, the K data packets are transmitted uncoded once using K time slots, constituting a systematic 
transmission phase [11]. Then, each receiver provides feedback 1 to the sender about the packets it has 
received or missed due to packet erasures. The complete packet reception state is represented by an 
N x K state feedback matrix (SFM) A, where a n j { . = 1 if R n has missed (and thus still wants) p k . and 

*We assume that there exists an error-free feedback link from each receiver to the sender that can be used with appropriate frequency. 
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(a) State feedback matrix A 


(b) S-Graph Q s 


Fig. 1. An example of SFM and its S- and G-IDNC graphs. 


(c) G-Graph Q g 


(in.k — 0 if R n has already received p/, : . The set of data packets wanted by R n is called the Wants set 
of R n and is denoted by W n . The set of receivers who want p/, is called the Target set of p k and is 
denoted by T k . The size of T k is denoted by T k . Packets with larger T k are more desired by receivers. 

Example 1. Consider the SFM in Fig. 1(a) with K = 6 data packets and N = 5 receivers. The Wants 
set of R\ is Wi = {pi,P 5 ,P 6 }- The Target set of p 3 is % = and thus T 3 = 2. 


B. Coded Transmission Phase: Two Types of 1DNC 

According to A, the sender generates IDNC coded packets under the binary field F 2 . Explicitly, IDNC 
coded packets are of the form X = 0 pfcgA/( P k, where Ad is a selected subset of Vk, and is called an 
IDNC coding set. Obviously, X has three possible decoding effects at each receiver: 

Definition 1.1. An IDNC coded packet X is instantly decodable for receiver R n if A4 contains only 
one data packet from the Wants set W n of R n , i.e., if \M. D W n = 1. 

Definition 1.2. An IDNC coded packet X is non-instantly decodable for receiver R n if A4 contains 
two or more data packets from the Wants set VV„ of R n , , i.e., if |Ad fl W„| > 1. 

Definition 1.3. An IDNC coded packet X is non-innovative for receiver R n if A4 contains no data 
packets from the Wants set W n of R n , , i.e., if \M. ft W n | = 0. Otherwise, it is innovative. 

Depending on which of the above three types of effects are allowed, there are two variations of IDNC. 
The first one is called strict IDNC (S-IDNC), which is the main subject of our study. It prohibits the 
transmission of any non-instantly decodable coded packets to any receivers. This restriction implies that 
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any two data packets wanted by the same receiver cannot be coded together. We thus have the concept 
of conflicting and non-conflicting data packets: 

Definition 2. Two data packets p * and p ? conflict if at least one receiver wants both of them, i.e., if 
3 n : {pi,Pj} C W n . Otherwise they do no conflict. 

An S-IDNC coding set is thus a set of pairwise non-conflicting data packets. The conflicting states 
among data packets can be represented by an undirected graph Q S (V,£). Each vertex v, e V represents 
a data packet p*. Two vertices v, and v 3 are connected by an edge e h] e £ if p, and p , do not conflict. 
Thus, every complete subgraph of Q s , a.k.a., a clique, represents an S-IDNC coding set. In the rest of 
the paper, we will use the terms “coded packet”, “coding set”, and “clique” interchangeably, and denote 
the last two by At. 

The main limitation of S-IDNC is that a coded packet which is instantly decodable for a large subset of 
receivers may be prohibited merely because it is non-instantly decodable for a small subset of receivers. 
In the second type of IDNC, called general IDNC (G-IDNC), the restriction on non-instantly decodable 
packets is removed to generate more coding opportunities. 

G-IDNC can also be graphically modeled [24]. The difference is that, in the G-IDNC graph Q g (V, £), 
a data packet p /. wanted by different receivers are individually represented by different vertices v n j., 
for all « nfe = 1. Consequently, the number of vertices in Q g is equal to the number of “l”s in A. Two 
vertices v m)l and v n j are connected by an edge if: 1) i = j, or 2) if p, f W n and p 3 f W m . In 
the first case, p, = p ; . and thus by sending p, both R m and R n can decode. In the second case, by 
sending p, © p ; , R rn and R n can decode p, and p ; , respectively, because they already have p ; and p ; . 
respectively. Similar to S-IDNC, every clique of Q g represents a G-IDNC coding set. 

We note that an S-IDNC coded packet is always a G-IDNC coded packet, but the reverse is not 
necessarily true. Below is an example of S- and G-IDNC coded packets. 

Example 2. Consider the SFM and its S- and G-IDNC graphs in Fig. 1. The G-IDNC graph indicates 
that (v ]A , v i 4 ) is a clique. The corresponding G-IDNC coding set is (pi,P 3 ,P 4 ), and thus X g = 
Pi © p .3 © Pi is a G-IDNC coded packet. X y is instantly decodable for R 1 , R 4 , R 5 because they only 
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want one data packet from X g . X g is non-instantly decodable for R : > because R-\ wants both p :! and 
P4- X g is non-innovative for R>. 

Due to the existence of R 3 , X g is not an S-IDNC coded packet. Whereas the S-IDNC graph indicates 
that (v \. v-2- v : > ) is a clique. The corresponding coding set is (pi, P 2 , P 3 ), and thus X s = Pi © P 2 © P 3 
is an S-IDNC coded packet, which can be verified to cdso correspond to clique {v l A . v 2} 2 , v 3i3 ) or clique 
(^1,1, 2, ^5,3) in the G-IDNC graph. 

We then introduce the notion of IDNC solution. A set of IDNC coding sets is called an IDNC solution 
if, upon the reception of the coded packets of all these coding sets, every receiver can decode all its 
wanted data packets. An S-IDNC solution is denoted by S s . The set of all S-IDNC solutions of a given 
SFM is denoted by § s . Similarly, we can also define S g and § 9 for G-IDNC. 

For the SFM in Fig. 1, by partitioning the S-IDNC graph into three disjoint cliques, we can obtain, 
among others, an S-IDNC solution of S s = {(pi, P4), (P2, Ps), (P 3 , P 6 )}- Similarly, a disjoint clique par¬ 
tition of the G-IDNC graph is {(^1,1, w 2 ) 2, v 5i3 , v 4A ), (v 3 : 3 ,v 1 ) 6 ,v 2 fi ,v 4fi ), (vi >5 , u 3 , 5 , v 5) 5 ), (v 3 ,4, %)}. 
indicating a G-IDNC solution of S g = {(pi, p 2 , p 3 , Pi), (P3,Pe), (Ps), (P4)}. 

To assess the performance of IDNC solutions, we now introduce our measures of throughput and 
decoding delay. 

C. Throughput and Decoding Delay Measures 

An S-IDNC solution S s requires a minimum of |«S s | coded transmissions. We call Us s — S s \ the 
minimum block completion time of S s . It measures the best throughput of S s with a value of K + Us 
packet per transmission. We further denote by U s the absolute minimum block completion time over all 
the S-IDNC solutions of A, i.e., U s = min{T/s s : S s e S s }. The definition of U g for G-IDNC follows. 

We measure decoding delay by average packet decoding delay (APDD) D, which is the average time 
it takes for a receiver to decode a data packet: 

I 

D = — ^ ^ tt n ,k ( 1 ) 

^Q"n,k = 1 

where u n ^ is the time index when R n decodes p/.,, and T = Ylk=i Tk, which is also the number of 
“l”s in A. Given an IDNC solution S, by letting u n j. be the first time index when S allows R n to 



TABLE II 


The decoding delay of original data packets at the receivers 



Pi 

P2 

P3 

P4 

Ps 

P6 

Ri 

l 

0 

0 

0 

4 

2 

R2 

0 

1 

0 

0 

0 

2 

r 3 

0 

0 

2 

3 

4 

0 

i?4 

0 

0 

0 

3 

0 

2 

Rs 

0 

0 

2 

0 

4 

0 


decode p^, (1) produces the minimum APDD of S. Then similar to U s (resp. U g ), we denote by D s 
(resp. D g ) the absolute minimum APDD over all S- (resp. G-) IDNC solutions of A. We also note that 
in the specific case of an S-IDNC solution S s , u n ^ is indeed the index of the first coding set in S s that 
contains p fc , as every receiver who wants p k can decode it from this coding set. 

Example 3. Consider the SFM in Fig. 1(a). Suppose that an S-IDNC solution with four coded packets 
X { = pi © P 2 , X 2 = P 3 © Pb, X 3 = p 4 , and X 4 = p 5 are transmitted in this order. The receivers’ 
decoding time { u n ^} are summarized in Table 11. The minimum APDD of this solution is D$ s = 
(Ix2 + 2x5 + 3x2 + 4x3)/12 = 2.5. 

In the presence of packet erasures, the sender needs to adopt a coded transmission phase. In each 
coded transmission, it selects and broadcasts a coding set through erasure-prone wireless channels. We 
denote by Ut the block completion time of this phase, and by D T the APDD of this phase, calculated as 
in (1). Ut and D T measure the throughput and decoding delay performance of this phase, respectively. 
They vary according to the IDNC solutions, transmission schemes, and erasure patterns. But it always 
holds that Ut f U s and I)j F D s if S-IDNC is applied. Therefore, U s and D s reflect the performance 
limits of S-IDNC. Hence, we will first study these limits in the next section, and then design S-IDNC 
transmission schemes and coding algorithms in Sections IV and V, respectively. 

III. Performance Limits and Properties of IDNC 


In this section, we study performance limits and properties of S-IDNC and compare it with G-IDNC. 
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A. Absolute minimum block completion time U s 

We first study the throughput limit of S-IDNC, measured by the absolute minimum block completion 
time U s . It has been proved that U s is equal to the size of the minimum clique partition solution 2 of Q s 
[13], denoted by S c . This equivalence holds because of the following property: 

Property 1. Removing any vertex from the S-IDNC graph does not change the connectivity of the 
remaining vertices. 

This property holds because vertices in Q s represent different data packets. Thus, to remove all vertices 
from Q s (i.e., to complete the broadcast), at least <S r cliques must be removed, which yields U s = 6V |. 

According to graph theory, |«S C | is equal to the chromatic number 3 x(&s) of the complementary graph 
Q s , which has the same vertex set as Q s , but has opposite vertex connectivity. We thus have U s = x(QQ)- 
This equality enables us to answer two important questions about U s : 1) how to find the U s of a given 
S-IDNC graph? 2) what are the statistical characteristics of U s under random packet erasures? 

1) The U s of an SFM: The chromatic number of a graph (and thus U s ) has been proven to be NP-hard 
to find and AXP-hard to approximate [33]. But there are heuristic algorithms and bounds for it. We will 
develop algorithms dedicated to S-IDNC in Section V, and focus on the bounds in this subsection. 

Tight bounds on x(9s) exist, but are also NP-hard to find. One such example is a tight lower bound 
w(Q s ) [34], the size of the largest clique of Q s . There are also loose bounds. For example: 

Property 2. All S-IDNC graphs with K vertices and M 0 edges have: 

U s > \K 2 /(K + 2M 0 )~| (2) 

where \x\ outputs the smallest integer greater than x. 

This bound is due to Geller [35] and its proof is omitted here. This bound is useful because it identifies 
the smallest achievable U s of all the S-IDNC graphs with K vertices and M 0 edges. 

“The minimum clique partition solution of a graph Q is the minimum set of disjoint cliques of Q that together cover all the vertices. 
’The chromatic number of a graph Q is the minimum number of colors to color the vertices so that any two connected vertices have 


different colors. 
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An existing upper bound on x(Ss) is A(f? s ) + 1 [33], where A(£? s ) is the largest number of edges 
incident to any single vertex in Q s . However, with given K and M 0 , this bound is not always achievable. 
To see this, assume Q s has K — 1 edges. Connecting these edges to the same vertex yields an upper 
bound of A (Q s ) + 1 = K. But no matter how we allocate these edges, there are always unconnected 
vertices in Q s , which indicates that x(G s ) < K. Hence, the upper bound is not achievable here. 

We are thus motivated to derive an achievable upper bound of U s as a function of K and M 0 . We 
first note that a set of pairwise unconnected vertices (a.k.a. an independent set) of Q s , denoted by V/, 
must be transmitted separately because their corresponding packets all conflict with each other. The size 
of V/ is K when there is no edge in Q s , indicating that U s — |V/| = K. Then, whenever a new edge 
is added to the graph, we can maximize U s by maximizing |V/|, which means that the edge should not 
connect two vertices in V/ whenever possible. Explicitly, our upper bound on U s is derived iteratively: 

. When there is no edge in the graph, we have U s = K; 

• When there are M 0 = [1 ,K — 1] edges, we can use them to connect V\ with v 2 ■■■Vk- Since 
Vo ■ ■ ■ v K are independent, we have U s = K — 1; 

• When there are up to K — 2 additional edges, i.e, when M 0 = [K,2K — 3], we can use these 
additional edges to connect v 2 with v 3 ■ ■ ■ v K . Since v : > ■ ■ ■ v K are independent, we have U s = K— 2; 

. The iterations will terminate when the graph is complete, i.e., when M 0 = K(K — 1)/2 and U s = 1. 

It can be easily proved that any reallocation of edges will reduce the U s derived above to a smaller 
value. Our upper bound on U s has the following stair-case profile: 


Property 3. All S-IDNC graphs with K vertices and M 0 edges have: 


K, 

M 0 

K — 1, 

M 0 

K- 2, 

M 0 

1, 

M 0 


M 0 = K(K - l)/2 


( 3 ) 
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2) U s as a function of system parameters: In addition to finding U s for a given S-IDNC graph, we 
are also interested in the statistical characteristics of U s , for which we assume that A (and thus Q s ) is 
obtained as a consequence of random packet erasures in the systematic transmission phase. 

For wireless broadcast, a common assumption on random packet erasures is that they are i.i.d. random 
variables with a probability of P e . Under this assumption, a similar question has already been introduced 
and answered for the RLNC technique. It has been shown in [6], [36], [37] that the block completion 
time of RLNC scales as (7(ln(iV)) when K is a constant. Consequently, the throughput of RLNC 
vanishes with increasing number of receivers N. To prevent zero throughput, it has been proved in [38] 
that K should scale faster than In (TV). 

Since the throughput of RLNC is optimal, it cannot be exceeded by the throughput of S-IDNC. Hence, 
we can infer that the throughput of S-IDNC should also follow a vanishing behavior with increasing 
N. However, its rate and specific dependence on system parameters have not been fully characterized 
in the literature. In this subsection, we answer this question through the following theorem: 


Theorem 1. The mean of the absolute minimum block completion time U s is a function of the block 
size K, the number of receivers N, and packet erasure probability P e as follows: 


E[U S \ 




logtf(l ~ P e) 


■N 


— c{K, P e ) ■ N 


(4) 

(5) 


where o(l) is a small term that approaches zero with increasing K. 

Proof: Our approach is to model the complementary S-IDNC graph Q s as a random graph with 
i.i.d. edge generating probability. Recall that two vertices in Q s are connected if the two data packets 
conflict, i.e., if at least one receiver has missed both packets. Therefore, the edge generating probability, 
denoted by P c , is calculated as: 

Pc = 1 - (1 - P?) N , ( 6 ) 


Then, the key is to prove that different edges are generated independently. We first consider the 
independency between two adjacent edges, say e l:2 and ei 3 , which share V\. The information carried 
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Fig. 2. The mean and simulated number of edges in Q a when P e = 0.2 and K € [20,100]. 

by e\ (resp. e 2 ) is that there is at least one receiver who wants both pi and p 2 (resp. pi and p 3 ). Hence, 
the mutual information between e j and e 2 is that p! is wanted by at least one receiver, which happens 
with a probability of 1 — (1 — P e ) N . Thus: 

/(e ll2 ; e 1)3 ) < H((l - P e ) N , 1 - (1 - P e ) N ) = H ( Pi ) (7) 

The inequality holds because other edges incident to p 3 also contribute to // (p |). It is clear that 
J(ei j2 ; ei. 3 ) quickly converges to zero with increasing N, indicating that e l 2 and ei >3 are asymptotically 
independent of each other. We note that two disjoint edges in Q s share no mutual information, and thus 
are mutually independent. Therefore, we can assume that edges in Q s are independently generated. 

Consequently, Q s can be modeled as an Erdos-Renyi random graph [39], which has K vertices and 
i.i.d. edge generating probability of P c . Fig. 2 compares the mean number of edges (with a value of 
K ( K — l)/2 • P c ) of our proposed random graph model and the simulated average number of edges 
in Q s . Our model shows virtually no deviation under all considered values of N and K. From graph 
theory, given K and P c , almost every random graph Q s has a chromatic number of: 

(8) 

Since U s = x(G s ), the above value is the mean of U s . By substituting (6) into (8) we obtain (4). ■ 

Theorem 1 has the following important corollary: 
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Corollary 1. The mean of the absolute minimum block completion time, E[U S ], of S-IDNC increases 
linearly with the number of receivers in wireless broadcast with i.i.d. random packet erasures. 

Then, by noting that the mean block completion time of the coded transmission phase is lower 
bounded by E[U S \, we conclude that the throughput of S-IDNC is significantly affected by the number 
of receivers. S-IDNC may not be a good choice when the number of receivers is large. We note that 
the above theorem and corollary are not directly applicable to G-IDNC, because the edge generating 
probability in G-IDNC is quite different. Interested readers are referred to [40] for more information. 

B. Absolute minimum average packet decoding delay D s 

Unlike U s , to the best of our knowledge there is no existing hardness result on finding D s . In this 
subsection, we address it through the following theorem and then propose an upper bound on D s . 

Theorem 2. It is NP-hard to find the absolute minimum APDD D s of S-IDNC. 

Proof: In order to prove it, we introduce the concept of perfect S-IDNC solution : 

Definition 3. An S-IDNC solution is perfect and is denoted by S p if every receiver R n can decode one 
of its |W n | wanted data packets from each of the first VV n coding sets in S p . 

This definition implies that S p , if it exists, offers the perfect packet decoding. Hence, its minimum 
APDD Ds p is not only the absolute minimum APDD D s of S-IDNC, but also a lower bound D of the 
minimum APDD of all other linear network coding techniques. Moreover, it can be easily shown that 
this lower bound can only be achieved by S p . 

It has been proved in [41] that it is NP-hard to determine the achievability of D_. Hence, it is NP-hard 
to determine the existence of S p . Then by contradiction, if it is not NP-hard to find D s , we can easily 
determine the existence of S p by comparing D s with D Sp . Therefore, it is NP-hard to find D s . ■ 
Besides the NP-hardness, D s has the following property: 

Property 4. The absolute minimum APDD D s is upper bounded by U s as 


2 


( 9 ) 
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Proof: Given an S-IDNC solution S s = {M u }u=ii let T(u) = J2 Pk £Mu^ k num ber of 

receivers who can decode a data packet from Ai u . The minimum APDD of S s is thus: 

1 u 

Ds, = f E T W'“ (10) 

u= 1 

which is maximized when {T(u)}% =1 = '{?■ In this case, D Ss = . Applying this result to an S-IDNC 

solution with absolute minimum block completion time U = U s , we obtain the result. ■ 

Our proof indicates that, although it is NP-hard to achieve D s , we can still effectively reduce APDD 
by reducing the S-IDNC solution size. Before we further explore this result to implement S-IDNC, we 
would like to compare the performance limits of S-IDNC that we have just derived with G-IDNC. 

C. S-IDNC vs. G-IDNC 

In this subsection, we address the question of how does S-IDNC compare with G-IDNC? 

We first note that the NP-hardness of finding D s also holds for D g . This is because the perfect 
S-IDNC solution S p is also the best possible G-IDNC solution. For the throughput, we first present a 
relation between S- and G-IDNC graphs (proved in the appendix): 

Theorem 3. The minimum clique partition solutions of S-IDNC and G-IDNC graphs have the same 
size. In other words, x{Ss) = x{Gg)- 

However, the above theorem does not imply U s = U g . This is because G-IDNC does not have Property 
1. Explicitly, by removing a vertex from Q g , more edges and larger cliques may be generated, and thus 
the absolute minimum block completion time U g can be smaller than x(0g) °f the original G-IDNC 
graph Q g [25]. We thus have U g f U s . We note, however, that a systematic way of finding U g other 
than brute-force search remains widely open. 

Therefore, when there are no packet erasures, the throughput of G-IDNC is at least as good as S- 
IDNC. But is this still true in more realistic erasure-prone scenarios? In the next section, we will design 
S-IDNC transmission schemes under packet erasures and compare them with G-IDNC. We will apply 
the above theorem to show that G-IDNC cannot outperform S-IDNC under certain circumstances. 
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IV. S-IDNC Transmission Schemes 

In this section, we design S-IDNC transmission schemes to compensate for packet erasures in the 
coded transmission phase, which are i.i.d. with a probability of P e . To this end, the sender has to 
regularly collect feedback from the receivers about their packet reception state to make online coding 
decisions. We consider two common types of feedback frequency, namely: 

1) fully-online feedback: feedback is collected after every coded transmission. However, this could 
be costly in wireless communications. We thus also consider a reduced feedback frequency next; 

2) semi-online feedback: feedback is only collected after several (to be quantified later) coded trans¬ 
missions; 

To be able to design S-IDNC transmission schemes, two questions need to be answered first: 

1) What is the optimization objective for throughput and decoding delay improvement? 

2) What does the sender need to send to achieve it? 

Before addressing these questions, we first highlight some challenges: 

Remark 1. Under random packet erasures, a reasonable measure of throughput is the mean block 
completion time E[Ut] of the coded transmission phase. However, it is intractable to minimize E\U T \. 
To see this, let us consider the stochastic shortest path (SSP) method [24]. In SSP method, the state 
space comprises the current SFM and its successors, and thus has a prohibitively large size with a value 
of 2 T , where T is the number of “1 ”s in A. The action space for each state comprises all cliques/coding 
sets, which is NP-hard to find [42]. Then, E\Uf\ is recursively minimized by examining all the states 
and the associated actions. Such examination is necessary, because the packet erasures can take any 
pattern and are not predictable. But it makes E[Ut\ intractable to minimize. To overcome this difficulty, 
we will turn to optimization objectives that are heuristic, but still based on SSP optimization principles. 

Remark 2. It is intractable to minimize the APDD Dt of the coded transmission phase due to the 
NP-hardness of finding D s , because otherwise by setting P e = 0, the minimum D T is equal to D s . To 
overcome this difficulty, we will give higher priority to the minimization of block completion time. In 
other words, we first minimize the block completion time. Then among the resultant coding decisions, 
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we choose the one that minimizes the decoding delay. Our prioritization reflects the motivation of using 
network coding, that is, to achieve better throughput performance. It also provides bounded decoding 
delay performance as we have shown in (9). This will also be confirmed by our simulations, which 
show that D t generally decreases with decreasing Ut- 

A. Fully-online Transmission Scheme 

With fully-online feedback frequency, the sender only transmits one coded packet before collecting 
feedback. Under the SSP method, the current state is the current SFM A, the absorbing state is the 
all-zero SFM and is denoted by A 0 . The action space comprises all the S-IDNC coding sets of A. The 
cost of each action is one, for it consumes one transmission. The block completion time Ut is thus 
equal to the number of transitions (a.k.a. path length or distance) between A and A 0 . 

According to Remark 1, it is intractable to choose an action/coded packet that minimizes the expected 
path length (and thus E\U T ]). As a heuristic alternative, we propose to choose an action/coded packet 
that belongs to the shortest path from A to A 0 , which has a length of U s . This choice guarantees 
that, upon the reception of the coded packet at all interested receivers, the shortest distance between 
the updated state A' and A 0 is minimized to U s — 1. To this end, the coded packet must belong to a 
minimum clique partition solution S c . Otherwise, the shortest distance between A' and A 0 is still U s . 

We then reduce APDD by forcing the coded packet to be maximal (and thus serving the maximal 
number of receivers). However, cliques in a minimum clique partition solution are not necessarily 
maximal. Hence, we further require the coded packet to belong to a set of U s maximal cliques that 
together cover all the data packets. This set is also an S-IDNC solution and is denoted by S m . 

In conclusion, we propose the following coded packet M ./ for fully-online transmission scheme: 

Given an SFM instance, the preferred coded packet A4 j is the most wanted coded packet in S rn , 
where S m is an S-IDNC solution that contains U s maximal cliques. 

B. Semi-online Transmission Scheme 

The fully-online transmission scheme is costly, not only in collecting feedback, but also in computa¬ 
tional load, as it has to find S m in every time slot. These problems can be mitigated by partitioning the 
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coded transmission phase into rounds. In each round, the sender transmits a complete S-IDNC solution 
and only collects feedback at the end of each round. We call this scheme the semi-online scheme. 

Under the SSP method, the action space is the set of all S-IDNC solutions § s , and the cost of each 
action is the solution size |«S s |, which is equal to the length of a semi-online transmission round. The 
total cost is thus equal to the block completion time. 

According to Remark 1, it is intractable to minimize the expected total cost (and thus E[Ut})- As 
a heuristic alternative, we propose to minimize the expected cost of the shortest path between A and 
Aq. The shortest path has a length of one, representing the event that every coded packet of the chosen 
solution S s is received by all the interested receivers after only one semi-online round. Denote the 
probability of this event by P s . Then the expected cost is \S s \/P s , where P s is calculated as: 

K 

Ps = Y[(l~P e dk ) Tk ( 11 ) 

k =1 

Here c4 is called the packet diversity and is defined below. 

Definition 4. The diversity df- of data packet p/, is the number of coding sets in S s that comprise p/,. 

We note that the minimum clique partition solution S c is not a preferred semi-online S-IDNC solution. 
Although S c offers the smallest solution size (|«S C | = U s ), it does not maximize P s because every data 
packet has a diversity of only one due to disjoint cliques in S c . Instead, the S m we have proposed for 
the fully-online case can offer a higher P s than S c due to possibly overlapping maximal cliques, while 
also offering the smallest solution size. 

We still wish to answer the following question before choosing S m as our preferred semi-online 
S-IDNC solution: Is there a solution that, though large in its size, provides higher packet diversities, 
so that P s is maximized? 

An explicit answer to this question is difficult to obtain, because it requires the examination of all 
the solutions of size greater than IJ S . Such search is costly and does not provide any insight into this 
question. Moreover, a solution with a larger block completion time is unlikely to provide higher packet 
diversities due to the following property of S-IDNC solutions: 


Property 5. Every coding set in an S-IDNC solution comprises at least one data packet with a diversity 
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Fig. 3. The fully- and semi-online transmission schemes. 

of one. 

This property holds because if every data packet in a coding set has a diversity of greater than one, 
then this coding set can be removed from the solution without affecting the completeness of the solution. 
Due to the above property, an S-IDNC solution S s has at least S s data packets with a diversity of only 
one. According to (11), these unit-diversity data packets reduce P s the most. 

Therefore, we choose S m for throughput improvement. Then, by taking into account our secondary 
optimization objective, i.e., the APDD, we define our preferred semi-online S-IDNC solution as follows: 

Given an SFM instance, the proposed semi-online S-IDNC solution is S rn , which comprises a set of 
U s maximal cliques. The cliques are sorted for transmission in the descending order of their numbers 
of targeted receivers to minimize the APDD. 

A flow-chart of the proposed two transmission schemes are presented in Fig. 3. Both the fully- and 
semi-online IDNC schemes require finding S m . Since packet diversity is not a concern in graph theory, 
there is no algorithms to find S m in the graph theory literature. Hence, we will design algorithms 
dedicated for S-IDNC in the next section. Before doing so, however, we briefly compare S-IDNC and 
G-IDNC under the above two transmission schemes. 

C. S-IDNC vs. G-IDNC 

With fully-online feedback, the sender can update the G-IDNC graph and add new edges representing 
coding opportunities after every transmission. The throughput of G-IDNC is thus better than S-IDNC. 
But the price is high computational load, because G-IDNC graph is much larger than S-IDNC graph 
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( 0(NK ) v.s. K). However, during a semi-online transmission round, the sender cannot update SFM 
due to the absence of receiver feedback. Consequently, it does not update the G-IDNC graph Q g [29], 
and only sends the minimum clique partition solution of Q g , which, according to Theorem 3, has the 
same size as the minimum clique partition solution of S-IDNC. We thus have the following corollary: 

Corollary 2. G-IDNC cannot reduce the length of a semi-online transmission round compared to S- 
IDNC. 


V. S-IDNC Coding Algorithms 

The two transmission schemes we proposed in the last section require finding S m , an S-IDNC solution 

that contains U s maximal coding sets. In this section, we develop its optimal and heuristic algorithms. 

A. Optimal S-IDNC coding Algorithm 

Our optimal S-IDNC coding algorithm finds S m in two steps: 

Step-1 Find all the maximal coding sets (maximal cliques): This problem is NP-hard in graph theory. 
We apply an exponential algorithm, called Bron-Kerbosch (B-K) algorithm [42]. The group of 
all maximal cliques is denoted by A. 

Step-2 Find S m from A: We propose a branching algorithm in Algorithm 1. The intuition behind this 
algorithm is that, if a data packet p k belongs to d k maximal coding sets in A, then one of these 
dk maximal coding sets must be included in S m for the completeness of S m . In the extreme case 
where d k = 1, the sole maximal coding set that contains p fc must be included in S m . Below is 
an example of Algorithm 1. 

Example 4. Consider the graph model in Fig. 4. In Step-1, we find all the maximal cliques: A = 

{(P1,P3). (P2,P3,P5), (P3,P4), (P4,Pe). (Ps, Pe) }- Then in Step-2: 

1) Initially, S = 0, S — A \ S = A, and the set of data packets not included in S is V = 
{pi, p 2 , Ps, P4, p,5, p 6 }. Since pi is only included in (p 1; p 3 ) and p 2 is only included in (p 2 , P3, Ps), 
these two coding sets must be added to S. Hence, S = {(pi,Ps), (P2,P3,Ps)} after the first two 


iterations; 
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Maximal cliques: 
(Pl,P 3 ) 
(P 2 ,P 3 ,P 5 ) 
(P 3 ,P 4 ) 

(P 4 ,Pe) 

(P 5 ,Pe) 

Step-1 


(pi > P3) 
l 

(P 2 ,P 3 ,P 5 ) 

/ \ 

(P 3 ,P 4 )(P 4 ,Pe) 

Step-2 


Fig. 4. An example of the 2-step optimal S-IDNC coding algorithm. 


2) The set of data packets not included in S is V = {p i. p (j }, and the remaining maximal coding 
sets are S = A \ <S={(p 3 , p 4 ), (p 4 , Pe), (P5,P6)}}- Since P 4 has a diversity of 2 under S due to 
(p 3 ,p 4 ) and (p 4 ,p 6 ), we branch S into two successors: Si = {(pi, P3),(P2, P3, P5MP4, P5)} an d 
S -2 = {(pi ■ p. 3 ),(ji 2 • P:t- P r ,),(p 4 . Pfj) }- Since S 2 contains all data packets and there are no other 
branching opportunities, the algorithm stops and outputs S 2 as S rn , 

B-K algorithm and Algorithm 1 constitute our optimal S-IDNC coding algorithm. It produces all the 
valid S m . Among these solutions, we can choose the one that optimizes a secondary criteria, such as 
the one offering the smallest D s , or the largest If, calculated using (11). 

B. Hybrid S-IDNC Coding Algorithm 

Algorithm 1 is memory demanding, because the number of candidate solutions grows exponentially 
with branching. Thus, we propose a heuristic alternative to it. The idea is to iteratively maximize the 
number of data packets included in S m . The algorithm is given in Algorithm 2. 

B-K algorithm and Algorithm 2 constitute our hybrid S-IDNC coding algorithm. It produces only 
one S-IDNC solution, with no guarantee on the solution size. It is still computational expensive due to 
B-K algorithm. Thus, we develop a polynomial time heuristic S-IDNC coding algorithm next. 

C. Heuristic S-IDNC coding Algorithm 

Algorithm 3 is a simple algorithm that heuristically finds the maximum (the largest maximal) clique 
of a graph. The intuition behind this algorithm is that, a vertex is very likely to be in the maximum 
clique if it is incident by the largest number of edges. Variations of this algorithm have been developed 
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Algorithm 1 Optimal S-IDNC solution search 

l: input: the group of all maximal coding sets, A; 

2 : initialization: a set B of solutions, B only contains an empty solution S = 0. A counter u — 1; 

3: while no solution in B contains all data packets, do 

4: while there is a solution in B with size u — 1, do 

5: Denote this solution by S = {.Mi, ■ ■ ■ Denote the data packets included in S by 

V = Ur=i{M:} and all data packets not included in S by V = Vk \ V. Also denote the 
maximal coding sets not included in S by S = A \ S\ 

6: Pick from V the data packet p that has the smallest diversity d under S. Denote the d coding 

sets which contain p by JA' yi • • • , M d ; 

7: Branch S into d new solutions, <S[, • • • ,S d . Then, add M.[, ■ ■ ■ ,A4' d to these solutions, 

respectively. The sizes of the new solutions are u; 

8 : end while 

9: U = U + 1; 

to: end while 

it: Output the solutions in B that contain all data packets. 


in the literature [12], [13], [24]. But this algorithm has not been applied to finding a complete S-IDNC 
solution, and its computational complexity has not been identified yet. 

The computational complexity of Algorithm 3 is polynomial in the number of data packets K. The 
highest computational cost occurs when the input graph is complete, i.e., when all vertices are connected 
to each other. In this case, only one vertex will be removed in each iteration. Thus, the number of 
remaining vertices in iteration-/' will be K — i, V/i G [0, K — 1], Then, to find the vertex with the largest 
number of incident edges, we need K — i comparisons. The total computational cost is thus in the order 
of Eto 1 K — i = K(K — l)/2. Hence, the computational complexity of Algorithm 3 is at most 0(K 2 ). 

We apply Algorithm 3 to iteratively find S m in Algorithm 4. In each iteration, we find a clique using 
Algorithm 3, maximize it by adding more vertices to it whenever possible, and then remove it from the 
S-IDNC graph. This will increase the diversities of the added vertices/packets. Below is an example: 
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Algorithm 2 Hybrid S-IDNC solution search 
l: input: the group of all maximal coding sets, A; 

2 : initialization: an empty solution S = 0, a counter u — 1 , packet set V = Vk\ 

3 : while S does not contain all data packets, do 

4: find the coding set M in A that contains the largest number of data packets in V; 

5: Add M to S and remove data packets in M from V; 

6 : U = U + 1 ; 

7 : end while 

8: Output the solution S. 

Algorithm 3 Heuristic maximum clique search 
i: input: graph Q(V,£)-, 

2 : initialization: an empty vertex set Vk eep ; 

3: while Q is not empty do 

4: add to Vkeep the vertex v which has the largest number of edges incident to it; 

5: update Q by deleting v and all the vertices not connected to v ( These vertices can be ignored 

because they cannot be part of the target clique, which contains v); 

6 : end while 

7: vertices in Vk eep are pair-wise connected, and no other vertices can be added to them. Hence, Vk eep 
is a maximal clique. 


Example 5. Consider the graph Q s in Fig. 1(b). In the first two iterations, the algorithm will choose 
Mi = (pi, P2, Pr) and M 2 = (P3, Pe), respectively. In the third iteration, V COV ered = {pi, P2, P3, Pi, P6} 
and the algorithm can only choose M 3 = (p.5). Among all the data packets in V coverec i, P2 can be added to 
M 3 . Thus M 3 = {p2, Ps}. The algorithm then stops and outputs S m = {(pi, P2, P4), (p.3, P 6 ), (P2, Ps)}- 

In conclusion, we proposed an optimal algorithm that finds S m , as well as its hybrid and heuristic 
alternatives. The output S m is used as the S-IDNC solution for the semi-online transmission scheme. If 
fully-online transmission scheme is applied, the transmitted coding set Mf is chosen from S m . 
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Algorithm 4 Heuristic S-IDNC solution search 
i: input: a graph Q{V,£); 

2 : initialization: an empty vertex set V COV ered, a working graph Q w = Q. and a counter i = 0; 

3: While V COV ered ^ V do 

4 : Find the maximum clique in Q w using Algorithm 3. Denote it by Mi ; 

5: Find the vertices in V CO vered which are connected to M t . Denote their set by V, ( They are the 

candidate vertices that could be added to Mi.); 

6 : Generate a subgraph of Q whose vertex set is V,. Denoted this subgraph by £,); 

7 : Find the maximum clique in Q' % using Algorithm 1, denoted it by M' {All vertices in M[ are 

connected to each other and thus can all be added to Mi.); 

8 : Update Vcovered by adding vertices in M t into it; 

9: Update Q w by removing M, from it; 

10 : Update Mi as M, = Mi U M\ {The new clique is at least as large as the old one, and thus 

provides higher packet diversity); 
it: i — i + 1; 

12 : end while 


VI. Simulations 

In this section, we present the simulated throughput and decoding delay performance of S-IDNC 
(abbreviated as S- in the figures) under different scenarios, including under full- and semi-online 
transmission schemes, and under the use of optimal, hybrid, and heuristic coding algorithms (abbreviated 
in the figures as Opt., Hybr., and Heur., respectively). The packet block size is K = 15. The number 
of receivers N is chosen between 5 and 40. The packet erasures are i.i.d. among the channels between 
the sender and the receivers probability with a probability of P e = 0.2. 

We also compare S-IDNC with RLNC and G-IDNC. For RLNC, we assume a sufficiently large 
finite field, so that its throughput is almost surely optimal and serves as a benchmark. For G-IDNC, 
although its best performance is at least as good as S-IDNC (as we have explained in Section III-C), this 


advantage will not necessarily be reflected in our simulation results. This is because there has not been 
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any optimal G-IDNC algorithm. Instead, we apply a heuristic algorithm (abbreviated as Heur. G- in 
the figures) proposed in [24], which aims at minimizing the block completion time. This aim coincides 
with our optimization priorities for S-IDNC in Remark 2, namely, to minimize the block completion 
time first. 

We conduct three sets of simulations. The first set compares the performance limits of the three 
techniques. The results are presented in Fig. 5. Here for RLNC, its absolute minimum block completion 
time is equal to the size of the largest Wants set of the receivers. This number cannot be further 
reduced by any means, because otherwise the most demanding receivers cannot decode all its wanted 
data packets. The second (resp. third) set of simulations compares the throughput and decoding delay 
performance under fully-online (resp. semi-online) transmission scheme. The results are presented in 
Fig. 6 (resp. Fig. 7). We note that the performance of RLNC is the same under both schemes, because 
RLNC is feedback-free. 

Our observations on S-IDNC are as follows: 

. The absolute minimum block completion time of S-IDNC increases almost linearly with N. This 
result matches Corollary 1; 

. The fully-online transmission scheme always provides better throughput and decoding delay per¬ 
formance than the semi-online one; 

. The optimal coding algorithm always provides better throughput performance than its hybrid and 
heuristic alternatives. This result verifies our choice of S m for throughput improvement, because 
only the optimal coding algorithm can always produce S m , which has \S rn \ = U s ; 

• However, the optimal coding algorithm does not necessarily minimize the APDD. For example, in 
Fig. 6(b), the hybrid algorithm provides smaller APDD than the optimal one under the fully-online 
transmission scheme; 

• The performance gap between the optimal and hybrid algorithms is always marginal, and is much 
smaller than their gap with the heuristic one. Hence, the hybrid algorithm strikes a good balance 
between performance and computational load. 


A cross comparison of RLNC, S-, and G-IDNC shows that: 
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(a) Block completion time 


(b) Average packet decoding delay 


Fig. 5. The throughput and decoding delay performance limits of S- and G-IDNC, as well as RLNC. 




(a) Block completion time (b) Average packet decoding delay 

Fig. 6. The throughput and decoding delay performance of fully-online transmission scheme when different coding algorithms are applied. 
It is compared with the performance of heuristic fully-online G-IDNC and RLNC. 


. The throughput of RLNC is always the best. The throughput of S-IDNC is very close to RLNC 
when the number of receivers is small. Their gap increases with N ; 

• In general, the APDD of both S- and G-IDNC is better than RLNC. This advantage only vanishes 
when the block completion time of S- and G-IDNC becomes much larger than RLNC, which takes 
place when N is much larger than K\ 

• There is no clear winner between the performance of heuristic G-IDNC and optimal S-IDNC. We 
can expect that G-IDNC will outperform S-IDNC if its optimal coding algorithm is developed. 

In summary, our simulations verified our theorems, propositions, and algorithms. They also demon¬ 
strated that, if we are concerned with both throughput and decoding delay performance, S-IDNC is a 
good alternative to RLNC when the number of receivers is not too large. 
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(a) Block completion time (b) Average packet decoding delay 

Fig. 7. The throughput and decoding delay performance of semi-online transmission scheme when different coding algorithms are applied. 
It is compared with the performance of heuristic semi-online G-IDNC and RLNC. 


VII. Conclusion 

In this paper, we studied the throughput and decoding delay performance of S-IDNC in broadcasting 
a block of data packets to wireless receivers under packet erasures. By using a random graph model, 
we showed that the throughput of S-IDNC decreases linearly with increasing number of receivers. By 
introducing the concept of perfect S-IDNC solution, we proved the NP-hardness of minimizing the 
average packet decoding delay. We also proposed two upper bounds on the throughput and decoding 
delay limits of S-IDNC. We considered two transmission schemes that requires fully- and semi- online 
feedback frequencies, respectively. By applying stochastic shortest path method, we showed that it is 
intractable to make optimal coding decisions in the presence of random packet erasures. We then used 
heuristic objective functions to determine the preferred coded packet(s) to send. We then developed the 
optimal S-IDNC coding algorithm and its complexity-reduced heuristics. We also compared S-IDNC 
with G-IDNC by proving the equivalence between the chromatic number of the complementary S-IDNC 
and G-IDNC graphs. We used this equivalence to show that G-IDNC can outperform S-IDNC when 
there are not packet erasures, but this is not always true when there are packet erasures. 

Our work provides news understandings of S-IDNC. It will facilitate the extension of S-IDNC to 
applications in other network settings, such as cooperative data exchange and distributed data storage. We 
are also interested in designing approximation and heuristic algorithms for decoding delay minimization. 
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Appendix A 
Proof of Theorem 3 

Theorem 3 requires the proof of x(@s) — x(G g )- Since every S-IDNC solution is also a G-IDNC 
solution, but a G-IDNC solution is not necessarily an S-IDNC solution, we have U s ^ U g , and thus 
X(Q S ) ^ x(G g )- Hence, here we only need to prove that \{Q S ) ^ x{Q 9 ). 

We first introduce the concept of affiliated S-IDNC graph Q as of a G-IDNC graph Q g , which is 
construct as follows. Given Q g that involves K data packets and N receivers, we generate a graph Q as 
with K vertices, each representing a data packet. We then connect v % and v 3 in Q as if for every pair 
of {m,n} e [1, A r ], v urn and v :j n are connected upon their existence in Q g . In other words, we claim 
that pi and p ; do not conflict if every vertex that represents p, in Q g is connected to every vertex that 
represents p ? in Q g . 

Given an SFM A, we can easily show that its S-IDNC graph Q s is the same as the affiliated S-IDNC 
graph Q as of its G-IDNC graph Q g . Hence, our task becomes to prove that x.(Gas) Q x(G g ), where 
x(Qas ) = U s . This statement is true if the following property is true: 

Property 6. After removing any clique M g from Q g , the chromatic number of the affiliated S-IDNC 
graph Q as is reduced by at most one. 

Since Q g is nonempty as long as Q as is nonempty, this property indicates that any clique partition 
solution of Q g must have a size of at least x(Gas), which will prove that x(Gas) ^ x(Sg)- Property 6 
can be proved through induction: 

1) If M g does not contain any conflicting data packets in Q as , then x(G as ) is reduced by at most one; 

2) If M g contains one pair of conflicting data packets in Q as , then x{Ss) is reduced by at most one; 

3) If M g already contains m pairs of conflicting data packets in Q as , then modifying M g to contain 
one more pair of conflicting data packets in Q as cannot further reduce x{Gas)- 

The first statement is self-evident, because the set of data packets included in such M g is a clique 
of Q as . By removing it, x(Oas ) can be reduced by at most one. 

To prove the second statement, without loss of generality we assume that the pair of conflicting data 
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packets is (pi, p 2 ). Then the set of data packets included in M g takes a form of {AT,, pi, p 2 }, where 
M s is the set of pair-wise non-conflicting data packets, and thus is a clique of Q as . Since pi conflicts 
with p 2 , there exists at least one pair of unconnected vertices in Q g that represent pi and p 2 . This pair 
is not included in M g , and thus is kept after removing M g from Q g . Hence, in the updated affiliated 
S-IDNC graph Q' as , V\ and v-> exist, and are unconnected. Let the chromatic number of Q' as be U 1 , 
then the minimum clique partition of Q' as takes a form of {M\. ■ ■ ■ , Mu'}, which keeps pi and p 2 in 
different coding sets. Then, since M s is a clique of Q as , {M s , M i, • • • , Mu 1 } is a partition of Q as with 
a size of U 1 +1. Thus, U 1 ^ x{Sas) — 1, implying that x{Gs) is reduced by at most one after removing 
M g from Q g . 

The proof of the third statement is similar to the second one, and thus is omitted here. According 
to the above three statements, no matter how many conflicting data packets are included in M g , after 
removing M g from Q g , the chromatic number of the affiliated S-IDNC graph Q as is reduced by at most 
one. Therefore, x{G g ) ^ x{Sas)- Since Q as is the same as Q s , we have x{G g ) ^ x{G s ) and Theorem 3 
is proved. 
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